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Abstract 

Logit dynamics [Blume, Games and Economic Behavior, 1993] is a randomized best 
response dynamics where at every time step a player is selected uniformly at random and she 
chooses a new strategy according to the "logit choice function" , i.e. a probability distribution 
biased towards strategies promising higher payoffs, where the bias level corresponds to the 
degree of rationality of the agents. While the logit choice function is a very natural behavioral 
model for approximately rational agents, the specific revision process that selects one single 
player per time step seems less justified. In this paper we thus focus on the dynamics 
where at every time step every player simultaneously updates her strategy according to the 
logit choice function. We call such a dynamics the "all-logit", as opposed to the classical 
"one- logit" dynamics. 

The all-logit dynamics for a game induces an ergodic Markov chain over the set of strategy 
profiles which is significantly different from the Markov chain induced in the one-logit case. 
In this paper we first highlight similarities and differences between the two dynamics with 
some simple examples of two-player games; we then give a characterization of the class of 
games such that the Markov chains induced by the all-logit dynamics are reversible and we 
show it is a subclass of potential games; finally, we analyze the mixing time of the all-logit 
dynamics for a well-known coordination game. 
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1 Introduction 



This paper considers the classical game-theoretic scenario of n selfish players, each one with a 
set of possible actions or strategies trying to maximize her own payoff or utility. The utility 
obtained by a player depends not only on her strategy but also on the strategies adopted by 
the other players. Of this classical scenario, we are interested in the dynamics by which players 
update their own strategies eventually bringing the system to reach a stable state. Any such 
dynamics can be seen as composed of two ingredients: 

• Selection rule: by which the set of players that update their strategy is determined; 

• Update rule: by which the selected players update their strategy. 

Roughly speaking, the classical notion of a Nash equilibrium can be seen as the stable state of 
the dynamics that composes the best response with a selection rule that selects one player at 
the time. In the best response update rule, the selected player picks the strategy that, given 
the current strategies of the other players, guarantees the highest utility. 

In this paper, we study a specific class of randomized update rules called the logit choice 
function [12} [16] which is a type of noisy best response that models in a clean and tractable 
way the limited knowledge (or bounded rationality) of the players in terms of a parameter /3 
(in similar models studied in Physics, f3 is the inverse of the temperature). Intuitively, a low 
value of /3 (that is, high temperature) models the situation where players choose their strategies 
"nearly at random"; a high value of (3 (that is, low temperature and entropy) models players 
that "almost surely" play their best response; that is, they pick the strategies yielding higher 
payoffs with higher probability. The logit choice function can be coupled with different selection 
rules so to give different dynamics. For example, in the logit dynamics [7] at every time step 
a single player is selected uniformly at random and the selected player updates her strategy 
according to the logit choice function. The remaining players are not allowed to revise their 
strategies in this time step. One of the appealing features of the logit dynamics is that it 
naturally describes an ergodic Markov chain. This means that the underlying Markov chain 
admits a unique stationary distribution which we take as solution concept. This distribution 
describes the long-run behavior of the system (which states appear more frequently over a long 
run) . The interplay between the noise and the underlying game naturally determines the system 
behavior: (i) As the noise becomes "very large" the equilibrium point is "approximately" the 
uniform distribution; (ii) As the noise vanishes the stationary distribution concentrates on so 
called stochastically stable states which, for certain classes of games, correspond to pure Nash 
equilibria. 

A distinctive feature of our work is the focus on the mixing time of the underlying Markov 
chain; that is, the time necessary to reach the stationary distribution. Our general approach 
consists in studying the long-term behavior of a system of n selfish players by looking at the 
stationary distribution of Markov chain induced by the specific dynamics that we take as de- 
scriptive of the behavior of selfish players with bounded rationality. This conceptual framework 
though is meaningful only if stationarity is reached quickly (in our approach "quickly" means 
in time polynomial in the number of players) so that observables measured at stationarity are 
descriptive of the system, On the other hand, if too long (that is, time exponential in the num- 
ber of players) is taken to reach equilibrium then one can say that the system is never in the 
stationarity and measures taken at stationarity do not say anything about the system. 

While the logit choice function is a very natural behavioral model for approximately rational 
agents, the specific selection rule that selects one single player per time step seems less justified. 
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Ideally, the selected equilibrium point should be the "natural" result of the game and of the 
players rationality level, and not of the selection rule. Therefore a natural question arises 

What happens if concurrent updates are allowed? 

For example, it is easy to construct games for which the best response converges to a Nash 
equilibrium when only one player is selected at each step and does not converge to any state 
when more players are chosen to concurrently update their strategies. Motivated by this, we 
study a dynamics in which all players update their strategies at every time step and the update 
rule is the logit choice function. We call such a dynamics all-logit, as opposed to the classical 
( one-) logit dynamics in which only one player at a time is allowed to move. Admittedly, this 
dynamics might be considered even less natural than the logit dynamics. However, it can be 
regarded as the "most different" version of noisy best response dynamics and thus as a sort of 
"worst-case" scenario in terms of robustness of logit dynamics. Furthermore, this dynamics has 
been also considered in pQ under the name of "instantaneous learning". The reason for this 
term is to consider the scenario in which players revise their strategies whenever they learn that 
another player has changed strategy. Note also that concurrent updates can be the result of 
players updating their strategies based on outdated information. 

Our contribution. Our goal is to understand the effect of the concurrent selection rule on 
the logit choice function and to study this dynamics (concurrent move + logit choice function) 
for every possible inverse noise f3 and see which properties of the original (one-)logit dynamics 
are preserved, and which are not. We compare the stationary distribution and the mixing time 
of the two dynamics (one-logit vs all-logit) for the same game. 

As a warm-up, we discuss a few classical two-player games for which (1) the stationary dis- 
tributions of the one-logit and of the all-logit are the same but the mixing times are significantly 
different, (2) the stationary distributions are different but the mixing times are the same, or 
(3) the stationary distributions and the mixing times are essentially the same. In particular 
one cannot infer that the two dynamics converge to the equilibrium in approximately the same 
time if the equilibrium is the same. Conversely, it can happen that the two dynamics converge 
in (asymptotically) the same time though to very different equilibria. 

We then study reversibility of the dynamics, an important property of stochastic processes 
which is also useful to obtain explicit formulas for the stationary distribution. We characterize 
the class of games for which the all-logit dynamics (that is, the Markov chain resulting from 
the all-logit dynamics) is reversible as a proper (though natural) subclass of potential games 
that we name social potential games. Social potential games generalize graphical games (games 
that are played by the nodes of a network and each node plays the same game with all of its 
neighbors). This class of games includes the games used to model the diffusion of technology 
in a social network [TTJ, [18]. As a by-product, one obtains that the all-logit dynamics of two- 
player potential games are reversible, while not all potential games result in a reversible all-logit 
dynamics. This is to be compared to the well-known result saying that one-logit dynamics of 
every potential game is reversible with stationary distribution being the Gibbs measure [7]. 
The Gibbs measure gives an intuitive characterization of the most "likely" states as those with 
minimal potential. One of the tools we develop for our characterization yields a closed formula 
for the stationary distribution of reversible all-logit dynamics. 

Finally, we give the first bounds on the mixing time of the all-logit. We start by giving a 
general upper bound on the mixing time of the all-logit in terms of the cumulative utility of the 
game. We then look at the well-known n-player Ising model on the complete graph (also called 
the Curie- Weiss model in Statistical Physics) and derive an upper bound on the mixing time 
that is tighter than the one obtained from our general upper bound. We complement the upper 
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bound for the Ising model on the complete graph with a lower bound. The two bounds show 
that the mixing time is constant for /3 = 0(l/n 2 ), polynomial in n for f3 = 0(logn/n 2 ), and 
exponential for j3 = Q(l/n). The mixing time for [3 between logn/n 2 and 1/n is still open. 

Related works. The all-logit dynamics for strategic games has been studied by Alos-Ferrer 
and Netzer in pQ which can be seen as complementary to our work. Specifically, in pQ the 
authors study general selection rules (including the selection rule of the all-logit) and investigate 
conditions for which a state is stochastically stable under the dynamics resulting from combining 
the selection rule with logit-choice function. A stochastically stable state is a state that has 
non-zero probability as the noise vanishes. On the contrary, we focus on a very particular 
selection rule but consider the whole range of values of the noise. Our bounds are quantitative 
in the sense that we express the stationary distribution and the mixing time as a function of 
the inverse noise /3. One of the results in [1] says that, for a class of selection rules that does 
not include the one of the all-logit, the set of stochastically stable states of the corresponding 
dynamics is a subset of Nash equilibria. They also derive an explicit formula for the stationary 
distribution of logit dynamics with general selection rules and a consequent characterization of 
stochastically stable states. 

In contrast, the (one-)logit dynamics has been actively studied starting from the work of 
Blume [7] that showed that for 2x2 coordination games, the long-term behavior of the system is 
concentrated in the risk dominant equilibrium (see [9]). The mixing time and the metastability 
of the (one-)logit dynamics for strategic games has been studied by [H [5]- Much work 
has been devoted to the study of the (one-)logit for graphical coordination games as they are 
used to model the spread of a new technology in a social network [81 [18]. A general upper 
bound on the mixing time of the (one-)logit graphical coordination game is given by Berger 
et al. [B]. Montanari and Saberi [T5] instead studied the hitting time of the highest potential 
configuration in graphical coordination games and relate this quantity to a connectivity property 
of the underlying network. 

2 Definitions 

In this section we briefly recall some standard game-theoretic notation and we formally define 
the Markov chain induced by the all-logit dynamics. 

Strategic games. Let Q = ([n], S\, . . . , S n , U\, . . . , u n ) be a finite normal- form strategic game. 
The set [n] = {1, . . . ,n} is the player set, Si is the set of strategies for player i £ [n], S = 
Si x S2 x • • • x S n is the set of strategy profiles and Ui : S — > R is the utility function of player 
i £ [n]. 

We adopt the standard game-theoretic notation and denote by S-i the set S-i = S± x . . . x 
Si-i x S{ + i x . . . S n and, for x = (x\, . . . , Xj+i, • • • , x n ) £ S-i and y £ Si, we denote by 
(x,y) the strategy profile (xi, . . . , y, x i+1 , ... ,x n ) £ S. 

Potential games |14| constitute an important class of games. We say that function <3? : S — > 
R is an exact potential (or simply a potential) for game Q if for every i £ [n] and every x £ S-i 



for all y,z £ Si. A game Q that admits a potential is called a potential game. 

Logit choice function. We study the interaction of n players of a strategic game Q that 
update their strategy according to the logit choice function [T2J \7\ [16] described as follows: 



Ui(x, y) - Ui(x, z) = $(x, z 



) - $(x,y) 
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from profile x £ S player i £ [n] updates her strategy to y £ Si with probability 

e /3«j(x_i,y) 

<*(y|x) = ^_ z y (1) 

In other words, the logit choice function leans towards strategies promising higher utility. The 
parameter (3 ^ is a measure of how much the utility influences the choice of the player. 



All-logit. In this paper we consider the all-logit dynamics, by which all players concurrently 
update their strategy. Most of the previous works have focused on dynamics where at each step 
one player is chosen uniformly at random and she updates her strategy by following the logit 
choice function. We call that dynamics one-logit, to distinguish it from the all-logit. 

The all-logit dynamics induces a Markov chain over the set of strategy profiles whose tran- 
sition probability -P(x, y) from profile x = (x±, . . . , x n ) to profile y = (y\, . . . , y n ) is 

P( x ,y)=l[o- i (y i \x) = — — « Ui{x _ iiZ) ■ (2) 

i = i 1 ii=i Z-izeSi c 

Sometimes it is useful to write the transition probability from x to y in terms of the cumulative 
utility of x with respect to y defined as C7(x,y) = £V tt,(x_j, yi). Indeed, by observing that 

n n 

i=i zeSi zeS«=i 



we can rewrite ([2]) as 



P(x ' y > = -D?^-- (3) 



where D(x) = Ylzes e /3C/ ^ x,z - ) . For a potential game Q with potential we can define the 
cumulative potential of x with respect to y as ^(x, y) = £\ $(x_j, yi). Simple algebraic ma- 
nipulations show that, for a potential game, we can rewrite the transition probabilities in ([3]) 

as 

e -/3*(x,y) 

^(x,y) = ^ r , 

where T(x) = £ ze5 e ~^' z \ 

It is easy to see that a Markov chain with transition matrix ([2]) is ergodic. Indeed, for 
example, ergodicity follows from the fact that all entries of the transition matrix are strictly 
positive. 



Mixing time. An ergodic Markov chain has a unique stationary distribution tt and for every 
starting profile x the distribution P*(x, ■) of the chain at time t converges to ir as t goes to 
infinity. The mixing time is a measure of how long it takes to get close to the stationary 
distribution from the worst-case starting profile 

imix(e) = inf {t G N : ||P*(x, •) - 7r|| TV ^ e for all x G S} , 

where ||P*(x, •) — 7t||tv = \ Yly&s l-f*( x > y) ~~ ^(y)! i s the total variation distance. We will 
usually write t m i x for t m ; x (l/4). We refer the reader to [11] for a more detailed description of 
notational conventions about Markov chains and mixing times. 
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3 Warm-up: two-player games 



A first natural question is whether there is any relation between the one-logit and the all-logit in 
terms of stationary distribution and mixing time. In this section we study stationary distribution 
and mixing time of the all-logit dynamics for three simple two-player games and we compare 
them with the corresponding results for the one-logit case. The analysis of the three games 
highlights that, in some sense, "everything can happen". Indeed, in the Matching Pennies 
example the stationary distribution of the all-logit dynamics is the same as the stationary 
distribution of the one-logit one, but mixing times are completely different; in the two-site Ising 
example the stationary distribution is different from the one-logit case, but the mixing time is 
asymptotically the same; in the Prisoner's Dilemma example both stationary distribution and 
mixing time have the same qualitative behavior as in the one-logit case. In addition we observe 
that, for 2x2 games, an important observable of the one-logit dynamics (the magnetization) is 
preserved when we look at the all-logit dynamics. 

Matching Pennies. We start by considering the classical matching pennies game. Here the 
first player wants to coordinate while the second player prefers not to. The payoff matrix is 



H 



H 
T 



+1,-1 


-1,+1 


-1,+1 


+1,-1 



According to ([T]) and ([2]), the transition matrix of the Markov chain induced by the all-logit 
dynamics is 



P = 





HH 


HT 


TH 


TT \ 


HH 


p(l - p) 


(1-pf 


P 1 


p(l-p) 


HT 


P 2 


p(l -p) 


p(l -p) 


(1-P) 2 


TH 


{l-pf 


p(l -p) 


p(l -p) 


2 

P 


TT 


p(l-p) 


o 

p 


{l-pf 


p(l-p) ) 



HH 



TH. 




HT 



TT 



where p = 1/(1 + e 2/3 )Q 

The transition matrix is doubly-stochastic so the uniform distribution is stationary (as for 
the one-logit case) and it is easy to prove that the mixing time is G (1/p) = (e 2 ^ 3 ) (while in 
the one-logit case it was upper bounded by a constant independent of /3). 

Notice that the black arrows in the picture draw a cycle over the four states, so the chain 
becomes more and more periodic as /? goes to infinity. 



Two-site Ising game. This game models the interaction of two particles each one having a 
possible magnetization (thus two states are possible, + and — ) and the game assigns a higher 
utility to profiles in which the two players have the same magnetization. For this example, we 
look at the case in which there is no external magnetic field and thus the utilities of profiles 
+- 1- and are equal. This game is a special case of a two-player coordination game (the class 

In the pictures of this section, black arrows indicate probabilities going to 1 while blue and red arrows indicate 
probabilities going to 0, as f3 goes to infinity. Red arrows go to zero faster than blue ones. 
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of games studied by [7j ) with no risk-dominant strategy [S] . The utilities of the two-site Ising 
game are the following 

| + 

+ ' 



+1,+1 


-1,-1 


-1,-1 


+1,+1 



It can be easily seen that the game is a potential game with potential such that 
$(++) = $( ) = -1 and $(+-) = $(-+) = 1. 

It is well known that the stationary distribution of the one-logit of a potential game is the Gibbs 
distribution. For the the two-site Ising game, the Gibbs distribution assigns to x G {+, — } 2 
probability e~^^^ /Z , where Z = ^ xe | + 12 e - ^*) is the partition function. 

The transition matrix of the Markov chain induced by the all-logit dynamics is 



P 





++ 






\ 


++ 


(\-pY 


P{l-P) 


Pil-P) 


P 2 


+- 


p(i-p) 


P 2 


il-pf 


Pil-p) 


-+ 


p(i-p) 


(1-P) 2 


P 2 


Pil-p) 
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Pil -p) 


il-pf J 




where p = 1/(1 + e 213 ) again. 

The fact that the transition matrix is doubly-stochastic implies that the stationary distribution 
of the all logit is uniform. The chain is reversible and the mixing time is 0(l/p) = G (e 2/3 ) (as 
in the one-logit case). 

Prisoner's Dilemma. Finally, we consider the following payoff matrix of a general two-player 
symmetric game 

_ C N 

C 



N 



a, a 


c, d 


d, c 


b,b 



In order for this payoff matrix to model the Prisoner's Dilemma (C= "Confess", A r ="Not con- 
fess") we want: 

1. a > d so that CC is a Nash equilibrium; 

2. b < c so that ./V./V is not Nash equilibrium; 

3. 2a < c + d < 2b so that NN is the social optimum and CC is the worst social profile. 
The transition matrix of the Markov chain induced by the all-logit dynamics is 

CC NC. 



( 




CC 




CN 


NC 


NN \ 




CC 




il-pf 




pil-p) 


pil-p) 






CN 


(1 


-p)(l- 


q) 


pil - q) 


qii-p) 


pq 




NC 


(1 


-p)(l- 


q) 


qii-p) 


pil - q) 


pq 


V 


NN 




(i-<z) 2 




qii-q) 


g(i - q) 


) 
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where p = 1/(1 + e^ a ~ d ^) is the probability a player does not confess (N) at next step given 
the other player is currently confessing (C) and q = 1/(1 + e^ c ~ b ^) is the probability a player 
does not confess (N) at the next step given the other player is currently not confessing (N). 
From conditions 1 and 2, it follows that both p and q go to as (3 goes to infinity. Notice that 
all the black arrows in the picture point at state CC. 

Observe that if p = q (i.e. if a — d = c — b) then the transition matrix becomes 



/ 



CC 



CN 



NC NN \ 





CC 


(1 


-v? 


p(l 


-p) 


p(l 


-p) 


<> 

p z 


p = 


CN 


(1 


-v? 


p(l 


-p) 


p(l 


-p) 


p 2 




NC 


(1 


-v? 


p(l 


-p) 


p(l 


-p) 


p 2 




\ NN 


(1 


-v? 


p(l 


-p) 


p(l 


-p) 


p 2 ) 



In this case the stationary distribution is ir{CC) = (1 — p) 2 , ir(NN) = p 2 and tt(CN) = 
ir(NC) = p(l — p). Moreover, the transition probability to a state does not depend on the 
previous state, thus after just one step the chain is stationary. In the general case the stationary 
distribution is 



tt(CC) 



n(NC) = tt(CN) 



p(l 



(1 + p 



tt(NN) 



P 



(1+p 



The stationary probability of state CC goes to 1 as /3 goes to infinity, and it is easy to prove 
that the mixing time is upper bounded by a constant independent of /3. 



Two-player potential games. As we have seen from the examples above the stationary 
of the one-logit and of the all-logit are markedly different. Notice though that for both the 
examples above that are potential games (Ising game and Prisoner's dilemma) the all-logit 
dynamics is reversible. This is no coincidence and we shall see that the all-logit is reversible for 
all two-player potential games. Actually, in Section [U we will characterize the class of games 
for which the all-logit dynamics is reversible. It turns out that it is a subclass of potential 
games, which we will call social potential games, and that includes all two-player potential 
games. In this section, we discuss some further relations between the one-logit and the all-logit 
of two-player games, starting from the magnetization. 

Let Q be a game in which each player has two strategies, denoted by and 1. We define the 
magnetization M(x) of profile x of Q as the difference between the number of players that play 
and the number of players that play 1 in x. The expected magnetization Ei(M) of the one-logit 
of Q is the expected value of M(x) for x distributed according to the stationary distribution of 
the one-logit. Similarly, the expected magnetization E^(M) of the all-logit of Q is the expected 
value of M(x) for x distributed according to the stationary distribution of the all-logit. 

In several games the magnetization has an interesting interpretation. The magnetization 
in the Ising game is naturally interpreted as the magnetization that one expects to observe 
once stationarity is reached. For the simple case of no external magnetic field that we have 
discussed above, it is easy to see that the expected magnetization is both for the one-logit 
and for the all-logit. For another example, coordination games on a network have been used to 
study the diffusion of a new technology (or of a new social norm, in general) in a social network 
(see [E]). Here each player can choose whether to stay with the old technology (corresponding 
to strategy 0) or switch to the new technology (corresponding to strategy 1). In this context 
the magnetization is simply the difference between the number of users that stayed with old 
technology and the number of adopters of the new technology. 
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We next show that for all 2 x 2 potential games Q, it holds that Ei(M) = E^(M). A 
2x2 game Q is a two-player game in which each player has two strategies, called and 1. 
Let $ : {0, l} 2 -flbea potential for the 2 x 2 game Q. For x G {0, l} 2 set vri(x) = 
and Z\ = ^ xe {o i}2 tti(x). As we have already recalled, the Gibbs measure 7Ti(x)/Zi is the 
stationary distribution of the one-logit of a potential game with potential $ (see [7]). The 
expected magnetization Ei(M) of the one-logit of Q is thus 

^ tti(OO) - vri(ll) 

Ei(M) = 2 • u ; v 
Zi 

To study the magnetization of the all-logit, we set ^(x) as 

vr A (00) = (tti(OO) + 7r x (01)) • (vn(00) + vn(10)) 
vr A (01) = (tti(OO) + tti(OI)) • (tti(01) + tq(11)) 
vr A (10) = (tti(OO) + 7r x (10)) • (tt^IO) + tti(11)) 

vta(H) = (7Ti(01) + 7ri(ll)) ■ (7ri(10) + 7Tl(ll)) 

and Za = Sxg{o i} 2 ^(x) is the partition function of the all-logit. From Theorem 14.7} it is not 
difficult to see that, for a 2 x 2 potential game, ita(x-)/Za is the stationary distribution of the 
all-logit. The expected magnetization Ea(M) of the all-logit of Q is thus 

E,(M) = 2.«W-M11) 



Z A 



We observe that 



Z A = (vr A (00) + ^a(OI)) + (M10) + ^4(11)) 

= (tti(OO) + tti(01)) • Z a + (tti(10) + 7Ti(ll)) • Zi 

= zl 

That is, the partition function of the all-logit is the square of the partition function of the 
one-logit. On the other hand, 

jta(OO) - 7^(11) = (7ri(00)+7ri(01))-(7ri(00) + 7ri(10)) 

- (7Ti(01) + TTl(ll)) • (TTl(lO) + TTl(ll)) 
= (7Tl(00) -tti(II)) -Z X 



and thus 



E ,(M) _ 2 .^(00)-^(ll) 

Za 



= 2 



= 2 



(7n(00)-7ri(ll))-Zi 
Z A 

7Ti(00) -TTl(ll) 



Zl 

Ei(M) . 



4 Reversibility and stationary distribution 

Reversibility is an important property of Markov chains and, in general, of stochastic processes. 
Roughly speaking, for a reversible Markov chain the stationary frequency of transitions from a 
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state x to a state y is equal to the stationary frequency of transitions from y to x. It is easy to 
see that the one-logit for a game Q is reversible if and only if Q is a potential game. As we shall 
see, this does not hold for the all-logit. Indeed, we will prove that the class of games for which 
the all-logit is reversible is a subclass of potential games: those games that are the "sum" of 
two-player potential games. 

4.1 Reversibility criteria 

Let Ai be a Markov chain with transition matrix P and state set S. Ai is reversible with 
respect to a distribution tt if, for every pair of states x, y £ S, the following detailed balance 
condition holds 

7r(x)P(x, y) = ir(y)P(y,x) . 

It is easy to see that if Ai is reversible with respect to tt then tt is also stationary. The Kol- 
mogorov reversibility criterion allows us to establish the reversibility of a process directly from 
the transition probabilities. Before stating the criterion, we introduce the following notation. A 
directed path T from state x £ S to state y £ S is a sequence of states (xq, xi, . . . , xg) such that 
xq = x and X£ = y. The probability P (T) of path F is defined as P (r) = Yij=i -P( x j-i> x j)- 
The inverse of path T = (xo,xi, . . . , xg) is the path T -1 = (xg, a^-i, . . . , xq). Finally, a cycle C 
is simply a path from a state x to itself. We are now ready to state Kolmogorov's reversibility 
criterion (for a proof see, for example, [10]). 

Theorem 4.1 (Kolmogorov's Reversibility Criterion). An irreducible Markov chain Ai with 
transition matrix P and state space S is reversible if and only if for every cycle C it holds that 

P (C) = P (C" 1 ) . 

We have the following lemma. 

Lemma 4.2. Let Ai be an irreducible Markov chain with transition probability P and state 
space S. M. is reversible if and only if for every pair of states x,y S S, there exists a constant 
c X) y such that for all paths T from x to y, it holds that 

p(r-i) ~ x ' y ' 

Proof. Fix x,y E S and consider two paths, T\ and T2, from x to y. Let C\ and C2 be the 
cycles C\ = T\ o 1 and C2 = T2 o T^ 1 , where o denotes the concatenation of paths. If A4 
is reversible then, by the Kolmogorov Reversibility Criterion, P (Ci) = P (C2) . On the other 
hand, 

p (Ci) = p (r x ) • p (rj 1 ) and p (c 2 ) = p (r 2 ) • p (r^ 1 ) . 

Thus 

PjTi) P(r 2 ) 
p (rr 1 ) p (r,- 1 ) ■ 

For the other direction, fix z 6 S and, for all x 6 S, set tt(x) = c ZjX /Z, where Z = ^2 x c ZjX is 
the normalizing constant. Now consider any two states x, y £ S of Ai, let T\ be any path from 
z to x and and set r 2 = T\ o (x,y) (that is, T 2 is F\ concatenated with the edge (x,y)). We 
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have that 

g_(ri) P(r 2 ) 
p (rr 1 ) " p (r- 1 ) 

p(r^) ' P(r!)-p(x, y ) 
P(y,x) 

P(x,y) 



and therefore M is reversible with respect to ff. □ 

The above lemma is very useful as it also gives an expression for the stationary distribution 
7T. Specifically, if we fix a state z, then for all x, ir(x) is proportional to P(x,z)/P(z,x) 

4.2 Reversibility implies potential games 

In this section we prove that if the all-logit for a game Q is reversible then Q is a potential game. 

The following lemma shows a condition on the cumulative utility of a game Q that is neces- 
sary and sufficient for the reversibility of the all-logit of Q. 

Lemma 4.3. The all-logit for game Q is reversible if and only if the following property holds 
for every x, y, z £ S: 

C/(x, y) - U(y, x) = (V(x, z) + U(z, y)) - (lJ(y, z) + U(z, x)) . (4) 

Proof. To prove the only if part, pick any three x, y, z £ S 1 and consider paths Ti = (x, y) 
T2 = (x, z,y). From Lemma 14.21 we have that reversibility implies 

Pgi) P(r 2 ) 
p (rr 1 ) p (r 2 1) 

whence 

e /3C/(x,y) ^(y) ^ e /3C/(x,z) e /?l/(z,y) £,( y ) £,( z ) 

D(x) e P u (y>*) ~ ~D(x) P>(z) e /3C/(y,z) e /3f/(z,x) • 

which in turn implies [U 

As for the if part, let us fix state z £ S and define vr(x) = z-p'{^\) ' wriere Z is the normalizing 
constant. For any x,y£S, we have 

tt(x) _ P(z,x) P(y,z) _ e^^ z ' x ) e^- 2 ) £>(x) _ eg^*) £>(*) _ P(y,x) 
fi(y) ~ P(x,z) ' P(z,y) ~ e^i^) ' ' D(y) ~ ' D(y) ~ P(x,y) ' 

where the first equality follows from the definition of w, the second and the fourth follow from 
([3]) and the third follows from ([!]). Therefore, the detailed balance equation holds for tt and 
thus the Markov chain is reversible. □ 

We are now ready to prove that the all-logit is reversible only for potential games. 

Theorem 4.4. If the all-logit for game Q is reversible, then Q is a potential game. 
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Proof. We show that if the all-logit is reversible then the utility improvement over any cycle of 
length 4 is 0. The theorem then follows by Theorem IA.ll 

Consider circuit T = (x, z, y, w) and let i be the player in which x and z differ and let j be 
the player in which z and y differ. Then y and w differ in player i and w and x differ in player 
j. In other words, z = (x_j,?/j) = (y_j,Xj) and w = (x_j,?/j) = (y_j,Xj). Therefore we have 
that 

U(*,y) = Efc^j «fe(x) + ui(z) + Uj(w) Z7(y,x) = Eit/ij u fc(y) + u i( w ) + %( z ) 

^(x, 2 ) = Efc^i,j^( x ) + +«i(x) ^(z,y) = Efc^j u fc( z ) +«i(y) 
£%, z ) = iLk^ijMy) + u i(y) + uj(z) u(z,x) = E^,j u fc( z ) + «»(«) + %( z ) 

By plugging the above expressions into Q and rearranging terms, we obtain 

\Ui(z) - Ui(x)J + ^Uj(y) - %(z)J + (ui(-w) - u»(y)) + (uj(x) - « 3 -(w)J = 
which shows that /(T) = 0. □ 
4.3 A necessary and sufficient condition for reversibility 

In the previous section we have established that the all-logit is reversible only for potential 
games and therefore, from now on, we only consider potential games Q with potential function 
<I>. In this section we present in Theorem l4.6l a necessary and sufficient condition for reversibility 
that involves the potential and the cumulative potential. The condition will then be used in the 
next section to prove that the n-player games that are the sum of two player potential games 
are exactly the games whose all-logit is reversible. 

We start by re- writing Lemma 14.31 in terms of cumulative potential. 

Lemma 4.5. The all-logit is reversible if and only if for every x, y, z G 5; 

(x, y) - (y, x) = (*(x, z) + (z, y)) - (tf(y, z) + *(z, x)) . (5) 

We are now ready to prove a necessary and sufficient condition for reversibility that involves 
potential and cumulative potential. 

Theorem 4.6. The all-logit for Q is reversible if and only if Q has potential function $ such 
that 

#(x 5 y)-#(y,x) = (n-2)($(x)-$(y)) for every x, y G S . (6) 

Proof. Clearly ^ implies As for the other direction, we proceed by induction on the 

Hamming distance between x and y. Let x and y be two profiles at Hamming distance 1; that 
is, x and y differ in only one player, say j. This implies that (jjj,x.-j) = y and (xj,y-j) = x. 
Moreover, for i ^ j, (yj,x_j) = x and (xj,y_j) = y. Thus, 

*(x,y)-*(y,x) = ^(sdfoX-O-Sfo.y-i)) 

i 

= ($(y) - $(x)) + (n - 1) ($(x) - $(y)) = (n - 2) ($(x) - $(y)) . 

Now assume that the claim holds for any pair of profiles at Hamming distance k < n and let 
x and y be two profiles at distance k + 1. Let j be any player such that Xj ^ yj and let 



12 



z = (yj,X-j): z is at distance at most k from x and from y. Then, by (0) and by the inductive 
hypothesis, we have 

*(x,y)-*(y,x) = (*(x,z) + *(z,y)) - (tf(y,z) +*(z,x)) 

= (n - 2) ($(x) + $(z) - $(y) - $(z)) = (n - 2) ($(x) - $(y)) . 

□ 

The above lemma and Lemma 14.21 allow us to express the stationary distribution for re- 
versible all-logit dynamics in the following way. We remind the reader that T(x) has been 
defined as T(x) = E zG s e_/3 * (x ' z) - 

Lemma 4.7. Let Q be a potential game with potential function and reversible all-logit. Then 
the stationary distribution of the all-logit for Q is 

vr(x) oc e ("- 2 )^*( x ) . T(x) . (7) 

Proof. Fix any profile y. The detailed balance equation gives for every x£S 

vr(x) = P(y,x) = fl ( * M _ t(y , x ))T(x) 
7r(y) P(x,y) T(y)' 

By Lemma 14.61 we have 

vr(x) = e ("-a)/»tt • T(x) f , - . 

v ; v ; ^ e (n-2)/9*(y) . r(y) J 

Since the term in parenthesis does not depend on x the lemma follows. □ 

We end the section by making an observation that will be used for proving the main result 
in the following. 

Observation 4.8. Let & be a potential function satisfying Then for all x,y € S, 
]T (<%;, x_,) - 3(z i9 y_i)) ={h-2) (<D(x) - $(y)) , 

where h is the Hamming distance between x and y. 
4.4 Social potential games 

In this section we prove that the games whose all-logit is reversible are exactly those potential 
games whose potential can be written as a sum of two-player potentials. We call these games 
social potential games. 

A potential : S\ x • • • x S n — > R is a two-player potential if there exist u, v G [n] such that, 
for any x, y E S with x n = y u and x„ = y v we have 3>(x) = 3>(y). In other words, is a function 
of only its «-th and v-th. argument. We say that potential game Q with potential is a social 
potential game if there exist ./V two-player potentials <&i, . . . , 3>at such that $ = $i + • • • + 
It is easy to see that generality is not lost by further requiring that 1 ^ I ^ V ^ N implies 
(ui,vi) 7^ (uii,Vi>), where U[ and vi are the two players of potential 

At every social potential game Q with potential = $i + • • • + <J?jv, we can associate a 
social graph Gthat has a vertex for each player of Q and has edge (u, v) iff there exists / such 
that potential &i depends on players u and v. In other words, we can see the players of a 
social potential game as sitting at the vertices of the social graph G and each player playing a 
(possibly different) two-player potential game with each one of her neighbors. 

The following theorem holds. 
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Theorem 4.9. The all-logit of a social potential game is reversible. 

Proof. We prove that any two-player potential satisfies ([6]) and then observe that the sum of 
two potentials satisfying ([6]) also satisfies ([6]). 

Let $ be a two-player potential and let u and v be its two players. Then we have that for 
w / u,v, §(y w ,x- w ) = $(x) and that 3>(y u ,x_ u ) = $(x v ,y- v ) and $(y v ,x- v ) = 
Thus 

*(x, y) = $(y u , x_ n ) + <S>(y v , x_„) + (n - 2)$(x) 

and 

*(y,x) = $(x„,y_ 1 ,) + $(x„,y_ u ) + (n-2)$(y) 
= $(y u , x_ n ) + x_„) + (n - 2)$(y) 

□ 

Next we prove that if an n-player potential $ satisfies flUJ) and thus the all-logit is reversible 
then <3> can be written as the sum of at most N = Q) two-player potentials, $i, . . . , $j\r- We 
do so by describing an effective procedure that constructs the N two-player potentials. 

Without loss of generality, we assume that each strategy set Si includes strategy and 
denote by the strategy profile consisting of n O's. Moreover, we fix an arbitrary ordering 
(ui, Vi ),..., (ujv, v jv) of the N unordered pairs of players. For a potential $ we define the 
sequence #o ; ■ ■ ■ , $N of potentials as follows: $0 = & and, for i = 1, . . . , iV, set 

0i = - $i (8) 

where, for x G 5, <&i(x) is defined as 

$i(x) = i9i_i(a; Ui ,a; Vi ,0_ um ) . 

Observe that, for j = 1, . . . , iV, 3>j is a two-player potential and its players are Uj and Vi. By 
summing for i = 1, . . . , N in ([8]) we obtain 

AT JV-1 AT 

£* = £*<-E*«. 

i=l i=0 i=l 

Thus 

JV 

i=i 

The next two lemmas prove that, if <I> satisfies ([U]), then i9tv is identically zero. This implies 
that <E> is the sum of at most ./V non-zero two-player potentials and thus a social potential game. 

A ball B(r,x) of radius r ^ n centered in x E S is the subset of S containing all profiles y 
that differ from x in at most r coordinates. 

Lemma 4.10. For any n-player potential function <3? and for any ordering of the pairs of 
players, i?jv(x) = for every x G B(2, 0). 

Proof. We distinguish three cases based on the distance of x from 0. 
x = 0: for every i ^ 1, we have 

M0) = 0i-i(O) - * f (0) = tfi-i(O) - ^_i(0) = . 

x is at distance 1 from 0: That is, there exists u G [ra] such that x = (x u , 0_ u ), with x u ^ 0. 
Let us denote by t(u) the smallest t such that the i-th pair contains u. We next show that for 
i ^ t(u), ^i(x) = 0. Indeed, we have that if u is a component of the i-th pair then 

0<(x) = <?i_!(x) - $i(x) = f _i(x) - i?i_i(x) = ; 
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On the other hand, if u is not a component of the i-th. pair then 

0<(x) = ^_i(x) - $i(x) = ^_l(x) - ^_i(0) = ^_i(x) ; 

x is at distance 2 from 0: That is, there exist u and v such that x = (x u , x v , 0_ u „), with x u ,x v 7^ 
0. 

Let t be the index of the pair (u, v). Notice that t ^ t(u),t(v). We show that #t(x) = and 
that this value does not change for all i > t. Indeed, we have 

t (x) = t _i(x) - $ t (x) = tf t _i(x) - i? t _i(x) = 0; 

If instead neither of u and u belongs to the i-th pair, with i > t, then we have 

0i(x) = i?<_i(x) - $i(x) = ^_i(x) - ^_x(0) = f _i(x) ; 

Finally, suppose that the i-th pair, for i > t, contains exactly one of u and v, say u. Then we 
have 

l? < (x)= 1 ? i _i(x)-*i(x)=i? i _i(x)-i? i _i(x u ,0_ tl ). 

We conclude the proof by observing that i — 1 > t ^ t{u) and thus, by the previous case, 
&i-i(x u ,0- u ) =0. □ 

The next lemma shows that if a potential $ satisfies (JBJ) and is constant in a ball of radius 
2, then it is constant everywhere. 

Lemma 4.11. Let & be a function that satisfies If there exist x £ S and c£R such that 
^(y) = c / or every y £ 5(2, x), i/ien 3>(y) = c /or every y 6 S. 

Proof. Fix h > 2 and suppose that <3?(z) = c for every z S i?(/i — l,x). Consider y € B(h,x) \ 
B(h — l,x) and observe that (yj,x_j) G 5(/i — l,x) and (xj,y_j) G 5(/i — l,x) for every i 
such that Xi 7^ From Observation 14.81 we have (h — 2)($(y) — ( I ) (x)) = which implies 

$(y) = $( X ) = C. □ 

We can thus conclude that if the all-logit of a potential game Q is reversible then ^ is a 
social potential game. By combining this result with Theorem 14.41 and Theorem 14.91 we obtain 

Theorem 4.12. The all-logit of game Q is reversible if and only if Q is a social potential game. 



5 Mixing time 

The all-logit dynamics for a strategic game has the property that, for every pair of profiles x,y 
and for every value of /3, the transition probability from x to y is strictly positive. In order to 
give upper bounds on the mixing time, we will use the following simple well-known lemma (see 
e.g. Theorem 11.5 in |13j). 

Lemma 5.1. Let P be the transition matrix of an ergodic Markov chain with state space 
For every y £ O let us name a y = mm{P(x,y) : x G £1} and a = ^2 ye QCt y . Then the mixing 
time of P is t mix = (D(l/a). 

In this section we first give an upper bound holding for every strategic game. We will then 
focus on a specific game, the Ising game on the clique (also known as the Curie- Weiss model in 
statistical physics) and we will give a refined version of the upper bound and a lower bound. 
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For a strategic game G, in Section [2] we defined the cumulative utility function for the ordered 
pair of profiles (x, y) as f7(x, y) = Ya=i n i( x -«> Hi)- Let us name AU the size of the range of 
U, 

AU = max{[7 (x, y) : x, y G S} — mm{U (x, y) : x, y G S} . 

By using Lemma 15.11 we can give a simple upper bound on the mixing time of the all-logit 
dynamics for Q function of f3 and AU. 

Theorem 5.2 (General upper bound). For every strategic game Q the mixing time of the 
all-logit dynamics for Q is (e^ Af/ ) . 

Proof. Let P be the transition matrix of the all-logit dynamics for Q and let x,y G S be two 
profiles. From ([3]) we have that 

e PU(*,y) i i 



Hence for every y G S it holds that 



e^ AU 

and a = X^yes Q y ^ e~^ AU . The thesis then follows from Lemma 15 .11 □ 
5.1 Ising game on the clique 

In this section we prove upper and lower bounds on the mixing time of the all-logit dynamics 
for the Ising game on the clique. In such a game, every player has two strategies, +1 and — 1, 
and the utility of player i G [re] is the sum of the number of players playing the same strategy 
as i, minus the number of players playing the opposite strategy, i.e. the utility of player i G [n] 
at profile x = (xi, . . . , x n ) G { — 1, +l} n is 

1Aj — ^ *^ j ' 

It is easy to see that such a game is a potential game with potential function 

$(x) = - XiXj . 

Due to the high level of symmetry of the game, the potential of a profile x depends only on the 
number of players playing dbl. If we name fc x := Y17=i Xi ^ ne magnetization of x we can write 
the potential of x as 

v ; 2 

The upper bound. Observe that, for the Ising model on the clique we have AU = 2n(n — 1), 
hence by using Theorem 15.21 we get directly that 



( e 2 P n{n-l)\ (g) 



Hence it follows that mixing time is 0(1) for (3 = 0(l/re 2 ) and it is 0(poly(n)) for f3 = 
0(log n/n 2 ). 

In what follows we show that factor "2" at the exponent in ([9]) can be removed and that a 
slightly better upper bound can be given for j3 > log n/n. 
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Lemma 5.3. For every x,y£fi it holds that 

P( X ,y) ^ ^+IM)/2 (1 _ g) (n-|fe y |)/2 

where 

1 



1 + e 20(n-l) ' 



Proof. Consider a profile y 6 { — l,+l} n and let Ay be its magnetization. Remember that the 
number of players playing +1 and — 1 in y can be written as n+ky and n 2 fcy , respectively. If y has 
positive magnetization A; y > 0, i.e. if the number of players playing +1 is larger than the number 
of players playing —1, then the profile that minimizes -P(x, y) is profile x_ = (— 1, . . . , —1) where 
every player plays —1. If we name 



q 



e -/9(n-l) + e /?(n-l) 1 + e 2/9(n-l) 

the probability that a player in x_ chooses strategy +1 for the next round, we have that 

n+fcy n — ky 

On the other hand, if y has negative magnetization k y < 0, -P(x, y) is minimized when x = 
x + = (+1, . . . , +1) and, since q is also the probability that a player in x + chooses strategy —1 
for the next round, we have that 



P(*+,y) = q 2 (1-9) 2 

and the thesis follows. □ 
Now we can give an upper bound on the mixing time by using lemmata 15.11 and 15.31 

Theorem 5.4 (Upper bound). The mixing time of the all-logit dynamics for the Ising model 
on the clique is 

^>mix — ^ ( ne^ 

If j3 ^ logn/ro the mixing time is 



O 



ne /3n 2 



Proof. From Lemma 15.31 it follows that for every y £ { — l,+l} n we have 

a y = min{P(x,y) | x G {-1, +1}™} > q ( n +\ k y\)/ 2 (l - g )(«-IM)/2 _ 

Hence 

a= Yl a y> E g (n+|feyl)/2 (l-g) {n - |fcyl)/2 . (10) 
ye{-l,+l} n ye{-i,+i} n 

Now observe that there are ( n -k ) profiles with magnetization k, and since q ^ 1/2, the largest 

2 

terms in ()10p are the ones with magnetization as close to zero as possible. In order to give a 
lower bound to a we will thus consider only terms with magnetization k = 0, when n is even, 
and terms with magnetization k = ±1, when n is odd. 

Case n even: If we consider only profiles with magnetization k = in (|10p we have that 



a »("/2> (i -* /2 - 
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By using a standard lower bound for the binomial coefficient (see e.g. Lemma 9.2 in |13| ) we 
have that 

f n \ 2 n 
\n/2J ' 

As for [q(\ — q)] n ^ 2 we have that 

q(i-q) = 



re/2/ n + 1 



1 + e 2/3(n-l) l + e -2/3(n-l) 
1 

e 20(n-l) + 2 + e -2/3(n-l) 
1 



- 1 ) (1 + 2e- 2 / 3 ( n - 1 ) + e" 4 / 3 ^" 1 )) 



(11) 



Now observe that for every j3 ^ we can bound 1 + 2e~ 2 ^ n ^ + e^™- 1 ) ^ 4. Thus we have 
that 

[q(l - q)] n / 2 > ^V^T • ( 12 ) 

Hence 

B V ,,«/9 - 1 



(n + i) e / 3n (™-i) ' 
And by using Lemma 15.11 we have 

t mix = O (ne^"- 1 )) . 
If f3 is large enough, say f3 ^ log n/n, in (fTT]) we can bound 

1 + 2e~ 2 ' 3 ("- 1 ) + e" 4 ^"- 1 ) ^ 1 + i . 

n 

Thus, in this case we have that 

" 9)]n/2 ^ * , ^(n/2) > fln(n-l) T ' ^ 

e /3n(n-i) n + 1/nY ' ' e pn[n 1 > • ye 



on 

Hence a > (TO+1)e i/ 2+/3tl(T1 -i) and 



(9 



ne /3n(n~l) 



2 



it 



Case re odd: If we consider only profiles with magnetization ±1 in ()10p we get 



n \ n + 1 ,^ % n-l 

n+1 



n+1 g— (1-q)- 



Now observe that 



^- = e-«- 1 ) and [ " ] > 1 2 " 



^ ' 2 n + 1 

By using bounds (JED and CED for [q(l - q)] n / 2 we get t mix = O (ne^^A for every /3 ^ 
and t mix = O ( ne ^~ 1) ) for /3 > log n/n. □ 
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The lower bound. A key function for the all-logit dynamics for a potential game with 
potential function <3? is 

T(x,y) = (n-2)$(x) + *(x,y) 
where ^(x, y) = Ya=i ^i^-i^Ui)- Indeed, from ((JJ) it follows that for a "social potential game" 

7T(x)P(x,y) = 1 /3((n-2)*(x) + *(x, y )) 

In order to give a lower bound on the mixing time, we first show that, for the Ising model on 
the clique, T(x, y) is symmetric and can be written as a function of the magnetization of the 
two profiles and the Hamming distance between them. 

Lemma 5.5. Let x, y € {— l,+l} n be two profiles with magnetization fc x and k y respectively 
and let <i Xj y be their Hamming distance, i.e. the number of players where they differ. Then 

(n - 2)$(x) - ^(x, y) = k x k y + 2d x , y - n . 

Proof. We already know that 3>(x) = n 2 fcx . In order to evaluate ^(x, y) = Ya=1 ^K x -i>2/i) ^ 
us name a, b and c as follows 

a = #{i £ [n] : x l = ; 

b = #{i G [n] : x { = +1, y-i = -1} ; 

c = #{i G [n] : Xi = -1, yi = +1} . 

In other words, a is the number of players playing the same strategy in profiles x and y, b is 
the number of players playing +1 in x and — 1 in y, and c the number of players playing —1 in 
x and +1 in y. It holds that 

n 

tf(x,y) = ^$(x_i, yi ) 

i=l 

n - kl n - (fc x - 2) 2 n _ (& x + 2 ) 2 

= a h b h c 

2 2 2 

= X - ((a + 6 + c)(n - fc x ) + 4(6 - c)£; x - 4(6 + c)) . (14) 
Now observe that a + 6 + c = n, 2(6 — c) = fc x — ky, and (6 + c) = d x ,y Hence from Q14p we get 

*( X , y) = g H n + k ^ + 2 ^ x ~ h y^ k * ~ 4dx >y) 

k-^k-y ~\~ 2c/x,y ^ • 

□ 

Since the Hamming distance between two profiles is at most n, from the above lemma we get 
the following observation. 

Observation 5.6. Let x, y be two profiles with k x ky ^ 0, then (n — 2)#(x) — ^(x,y) ^ n. 
Now we can give a lower bound on the mixing time by using the bottleneck-ratio technique. 



n n — 2 2 
2 2 x 



Thus 



(n-2)$(x)-*(x,y) 
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Theorem 5.7 (Lower bound). The mixing time of the all-logit dynamics for the Ising model 
on the clique is 

f e /3n(n-2)\ 



4' 



Proof. Let S C {— l,+l} ra be the set of profiles with negative magnetization 

S = {xG {-l,+l} n : A: x <0} 

and observe that tt(S) ^ 1/2. From Observation 15.61 we have that for every x G S and y G S it 
holds that 

7T(X)P(X, y) = ^ e /3[(n-2)*(x)-*(x,y)] ^ ^n /z _ 
Zj 

Moreover, if we name x_ the profile where everyone is playing —1 we have that 

tt(S) ^ vr(x_) ^ I e - 2 ^( x -) = I e ^(n-i) (17) 



Hence, by using bounds (fT6|) and (fTT|) . and the fact that the size of S is at most 2 n , we can 
bound the bottleneck at S with 

R . Q . Q(S,S) Ex € 5E yg 5^(x)P(x,y) 2 2 "- 2 e^ 2 2 "~ 2 
i>('-'J = — 777— = 777; ^ — 



7T(5) 7r(5) ^ e /3n(n-l) e [3n(n-2) " 

By using the bottleneck-ratio theorem (see e.g. Theorem 7.3 in [TTJ) it follows that 



trr\\-x — Q 



2 2n 



□ 



Remarks. In this section we proved upper and lower bounds on the mixing time of the all- 
logit dynamics for the Ising model on the clique. In particular, the upper bound shows that 
for j3 = 0(l/n 2 ) the mixing time is constant and for (3 = C(log n/n 2 ) it is at most polynomial. 
The lower bound shows that, for every constant e > 0, if j3 > (1 + e) (log 4)/n the mixing time 
is exponential. When /3 is between 0(log n/n 2 ) and 0(l/n) we still cannot say if mixing is 
polynomial or exponential. 



6 Conclusions and open problems 

In this paper we studied some properties of the dynamics induced by the logit choice function 
when all players play concurrently and we compared the results with the case of the one-logit 
dynamics. It is well-known that the class of games whose one-logit is reversible is the class of 
potential games. We showed that the class of games such that the all-logit is reversible is a 
natural subclass of potential games that we called social potential games. We also derived the 
first general upper bound on the mixing time of the all-logit and specific upper and lower bound 
for the well-studied Curie- Weiss model. 

In the one-logit dynamics one player is selected uniformly at random at each step while in 
the all-logit one every player updates her strategy at every step. A natural generalization is 
to consider a probability distribution \i over the family of subsets of players, such that at each 
step a subset of players is chosen for the update according to \i. It would be interesting to see 
how stationary distributions and mixing times are affected by fx, and to find out whether there 
is any observable of the dynamics that, when measured at stationarity, does not depend on /j,. 
Some interesting results along that direction have been obtained in [Tj [2]. 
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A A characterization of potential games 

In this section we review a characterization of potential games in terms of the utilities. Let Q be 
a game. A circuit V = (so, . . . , s#) is a sequence of strategy profiles such that sq = se, Sh ^ Sfc 
for 1 h ^ k ^ I and, for k = 1, . . . , £, there exists player such that Sk-i and s& differ only 
for player if,. For such a circuit V we define the utility improvement IiV) as 

e 

H T ) = Kt( s *) - «t*(*fc-i)] • 

k=l 

The following theorem holds. 

Theorem A.l (Monderer and Shapley [14J). A game Q is a potential game if and only if 
1(F) = for all circuits of length 4. 
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